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Abstract — Cognitive Radio Networks allow unlicensed users to 
opportunistically access the licensed spectrum without causing 
disruptive interference to the primary users (PUs). One of the 
main challenges in CRNs is the ability to detect PU transmissions. 
Recent works have suggested the use of secondary user (SU) 
cooperation over individual sensing to improve sensing accuracy. 
In this paper, we consider a CRN consisting of a single PU 
and multiple SUs to study the problem of maximizing the total 
expected system throughput. We propose a Bayesian decision rule 
based algorithm to solve the problem optimally with a constant 
time complexity. To prioritize PU transmissions, we re-formulate 
the throughput maximization problem by adding a constraint 
on the PU throughput. The constrained optimization problem is 
shown to be NP-hard and solved via a greedy algorithm with 
pseudo-polynomial time complexity that achieves strictly greater 
than 1/2 of the optimal solution. We also investigate the case for 
which a constraint is put on the sensing time overhead, which 
limits the number of SUs that can participate in cooperative 
sensing. We reveal that the system throughput is monotonic over 
the number of SUs chosen for sensing. We illustrate the efficacy of 
the performance of our algorithms via a numerical investigation. 



I. Introduction 

Cognitive radio networks (CRNs) have been proposed to 
address the spectrum scarcity problem by allowing unlicensed 
users (secondary users, SUs) to access licensed spectrum on 
the condition of not disrupting the communication of licensed 
users (primary users, PUs). To this end, SUs sense licensed 
channels to detect the primary user (PU) activities and find 
the underutilized "white spaces". FCC has opened the TV 
bands for unlicensed access |2Q, and IEEE has formed a 
working group (IEEE 802.22 17J) to regulate the unlicensed 
access without interference. Many other organizations are 
also making efforts on the spectrum access policy in the 
CRN environment, e.g., DARPA's 'Next Generation' (XG) 
program ifTTl mandates cognitive radios to sense signals and 
prevent interference to existing military and civilian radio 
systems. To avoid the interference to PUs, sensing becomes 
an indispensable part of CRN design. 

Sensing can be performed via several methods, including 
energy detection, cyclostationary feature detection, and com- 
pressed sensing 0|. Energy detection is a simple method 
and requires no a priori knowledge of PU signals 11211 . Its 

' The recent FCC luling requires the use of central TV Band usage databases 
to verify spectral availability. While respecting this ruling, our work explores 
local cooperative methods to improve sensing accuracy with the potential 
outcome of relieving this burdensome requirement. 



main disadvantage is its decreased accuracy in face of fading, 
shadowing, and unknown noise power profiles. For instance, 
if an SU suffers from shadowing or heavy fading, the sensed 
signal tends to be weak while the PU is transmitting, leading 
to incorrect decisions. To address these problems while main- 
taining sensing simplicity, cooperative sensing schemes that 
fuse the sensing results of multiple SUs have been proposed 

siniiiTi. 

Cooperative sensing overcomes shortcomings of individual 
sensing results by jointly processing observations. SUs in a 
locality report their individual sensing results, which are then 
used in a predefined decision rule to optimize an objective 
function. Examples of such functions include maximizing 
sensing accuracy (generally, a function of false alarm proba- 
bility and mis-detection probability) or maximizing the system 
throughput. Aside from maximizing sensing accuracy related 
metrics, cooperative sensing schemes are also designed to 
estimate the maximum transmit power for SUs so that they 
do not cause disruptive interference to PUs ilU . On the other 
hand, cooperative sensing incurs additional sensing delay viz 
a viz individual sensing. 

Three main categories of decision rules have been identified 
in IHl: Soft Combining, Quantized Soft Combining, and Hard 
Combining. In the first two categories, the sensing results are 
sent to the fusion center with little or no processing, while 
in the last one, binary local decisions are usually reported. 
Similar to sensor networks, linear fusion rules are widely 
applied to achieve a cooperative decision, such as AND, OR 
and majority rules lfT4l . In addition, a more advanced fusion 
technique that utilizes statistical knowledge ||20| has been 
devised to capture the correlation between SUs in cooperative 
sensing. However, the resulting algorithm is suboptimal and 
its approximation factor is unknown. None of the above- 
mentioned works identify optimal decision rules for general 
decision structures and they require decision rules to assume 
particular forms (e.g., linear) for optimality analysis. 

In this paper, we design an optimal data fusion rule to (hard) 
combining of the reported sensing result. More specifically, we 
aim to maximize the system throughput in a CRN composed 
of a single PU (i.e., single channel) and several SUs. While the 
target system is a simplified one, it is helpful in revealing the 
challenges associated with the design of optimal fusion rules. 
Moreover, the resulting algorithms can easily be generalized 
to more complex systems comprised of multiple channels, 
where sensing decisions are made per channel. Our main 
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contributions can be summarized as follows: 

• In contrast to previous works that restrict the class of 
fusion rules, we propose a Bayesian decision rule based 
algorithm to solve the throughput maximization problem 
optimally with constant time complexity. 

• To guarantee resources for the PU, we re-formulate the 
problem by adding a constraint on the PU throughput. 
This constrained problem is shown to be NP-hard by 
reducing the classical partition problem [i6J to it. A 
greedy algorithm is obtained with pseudo-polynomial 
time complexity. This approximation algorithm is ana- 
lytically shown to achieve strictly greater than 1/2 of the 
optimal solution. 

• We investigate systems where limited sensing overhead 
is allowed, i.e., the number of sensing SUs is restricted. 
Our theoretical results show that the performance of 
cooperative sensing is monotonic over the number of SUs 
used for sensing. However, the characterization of the 
upper bound on the required number of sensing nodes 
remains elusive. 

The paper is organized as follows: Related work is presented 
in Section HI] In Section Hill the system model is introduced. 
The system throughput maximization problem is formulated 
in Section IIVI and solved optimally via Bayesian decision 
rule. In Section |V] the constrained maximization problem 
is formulated, which is shown to be NP-hard. A pseudo- 
polynomial time greedy algorithm is proposed with an ap- 
proximation factor strictly greater than 1/2. Another direction 
is considered in Section |VT] where the system throughput is 
maximized subject to a constraint on the number of sensing 
SUs used. In Section IVIII numerical results are presented for 
the performance of our algorithms. The paper is concluded in 
Section IVnl] 

II. Related Work 

Cooperative sensing solutions have been investigated in 
recent years. They rely on multiple SUs to exchange sensing 
results or a central controller to collect the sensing results from 
SUs. The network is usually divided into clusters and each 
cluster head makes the decision on the channel occupancy. 
Collaborations among SUs have been shown to improve the 
efficiency of spectrum access and allow the relaxation of 
constraints at individual SUs |l3l ll24ll . One branch of the papers 
in cooperative sensing assume that the length of sensing time 
at individual SUs is proportional to the sensing accuracy. 
However, longer sensing time decreases the transmission time. 
The trade-off is called the sensing efficiency problem and 
is discussed in UOI and 1121 . In our work, we assume the 
observation time at each SU is fixed so that the individual 
sensing accuracy does not depend on it. We focus on the 
optimal decision rule based on the sensing results collected. 

Decision rules so far mainly focus on AND, OR, majority 
rules and other linear rules. Zhang et. al. [(25l show that the op- 
timal fusion rule to minimize the cooperative sensing error rate 
is the half-voting rule in most cases. They show that AND or 
OR rules are optimal only in rare cases. However, other rules 




Fig. 1. System model of an SU network overlayed with a PU networl;. 

with more complicated forms have not been considered in ||25]| . 
Based on these observations, a fast spectrum sensing algorithm 
is proposed for a large network where not all SUs are required 
for sensing while satisfying a given error bound. However, the 
optimal number of sensing nodes and the complexity of this 
problem have not been discussed. In |[T4l , the SU throughput 
is maximized subject to sufficient protection provided to PUs. 
The optimal fc-out-of-A^ fusion rule is determined and the 
sensing/throughput trade-off is also analyzed. As in IZSll . 
no fusion rules of general forms are considered. Thus, the 
optimization is restricted to a small fusion rule domain. Shahid 
et. al. 0^ consider the spatial variation of SUs and the fusion 
rule is a weighted combination of SU observations. The weight 
depends on the received power and path loss at each SU. 
Though more advanced than AND, OR, and majority rules, 
the weighted form is restricted to the linear function domain. 
In m, optimal multi-channel cooperative sensing algorithms 
are considered to maximize the SU throughput subject to per 
channel detection probability constraints. The resulting non- 
convex problem is solved by an iterative algorithm. Compared 
to m, our work focuses on the maximization of the system 
throughput, including the PUs and SUs. Although we only 
consider a single-channel network, which is a simplification 
made on the model, our decision algorithms can be applied 
for each channel individually. Moreover, a soft decision rule 
is considered in |4|, which requires significant amount of data 
to be transmitted to the coordinator while our hard decision 
rule requires only one bit sent from each SU. 

III. System Model 

We consider a time-slotted cognitive radio network in which 
a PU network, consisting of a PU base station (PU-BS) and 
PU receivers, co-exists in the same area with an SU base 
station (SU-BS) and M SUs (Figure [TJ- We focus on the PU 
transmissions over a particular channel. We consider uplink 
part for the SU system, i.e., only one SU can be active and 
transmit to the SU-BS at any given time. Some PU receivers 
may lie in the interference range of SUs such as PU 1 in 
Figure [T] Any transmission from these SUs such as SUs 1, 
2, and 3 in Figure [T] may cause interference to those PU 
receivers. We denote the set of SUs whose uplink transmission 
causes interference to PU receivers by S and \S\ = 
(M > N). They are indexed from 1 to N. SUs outside S can 
use the channel to transmit at any time slot without causing 
interference to the PUs. 

SUs in S are close to the PU network and they may sense the 
channel cooperatively to reduce the sensing errors. The sensing 
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Fig. 2. Control slot Tc and data slot T^. 

results of individual SUs are assumed to be independent. Let B 
represent the PU activity such that i? = 1 if PU is active, and 
B = otherwise. Let Pj denote the probability of a false 
alarm for SU i, which is the probability that SU i senses 
the PU to be active given that the PU is actually idle, 
represents the probability of mis-detection for SU i, which 
is the probability that SU i senses the PU to be idle given that 
the PU is actually active. 

Cooperative Sensing: Multiple SUs are chosen to sense the 
channel and the SU-BS predicts the PU activity by collecting 
the sensing results from these SUs. We denote the set of SUs 
that participate in the cooperative sensing as 5*0, where l^ol = 
k. Note that 5*0 C S. In the cooperative sensing model, we 
assume the SU-BS collects sensing results from SUs in 5*0. 

Cooperative Sensing Indicator; The observation of the 
PU activity by SU i is denoted by o^. = 1 indicates that 
SU i observes the PU to be active, while Oi ~ indicates 
that SU i observes the PU to be idle. In this paper, our 
objective is to characterize 5*0 and estimate the PU activity 
based on observations from 5*0 (called the decision rule). The 
decision rule is denoted as a function / : — > 17 where 

= {0, 1}. The observations form a vector o, where o G fi*^ 
while the decision is denoted by O where O G fl. The 
false alarm probability of cooperative sensing is denoted by 
Pf = P{0 = l\B = 0). The mis-detection probability of 
cooperative sensing is denoted by P,';^ = P{0 = 0\B = 1). 
One time slot is divided into a control slot Tc and a data slot 
Td where Tc + Ta = 1 (Figure O. In the control slot, the 
SU-BS collects sensing results from So and notifies an SU 
in S if the cooperative sensing result is "idle" {O = 0). If 
the PU is active (mis-detection), the PU transmission will be 
collided with the transmission from the SU. The length Tc 
of the control slot is regarded as the sensing overhead and 
assumed to be constant throughout the paper It means that a 
fixed time period is allocated for cooperative sensing in each 
slot. 

The uplinks of SUs in S are assumed to have the same 
capacity which is normalized to 1. If the decision of the 
cooperative sensing at the SU-BS is "idle", the SU-BS notifies 
one of the SUs in S (not limited to 5*0, the sensing set) to 
transmit. We assume SUs in S are always backlogged. The 
scheduling of the transmitting SUs is beyond the scope of 
this paper However, any work-conserving scheduling policy 
operating on idle slots can be used together with the decision 
rule to maximize the total system throughput. We let ttq denote 
the probability that the PU is idle and we assume that the prior 
distribution of PU activity is acquired over time accurately. 



TABLE I 

Notation List 



Symbol 


Meaning 


M 


Total number of SUs in the secondary network 


S 


Set of SUs wfiich cause interference to PU receivers 


N 


\s\ 


So 


Set of SUs that are chosen to sense the channel. So Q S 


k 


\So\ 




False alarm probability of SU i 


pL 


Mis-detection probability of SU i 


Pf 


False alarm probability of cooperative sensing 


PL 


Mis-detection probability of cooperative sensing 


To 


Control slot 


Ta 


Data slot 




Probability that the PU is idle 


7 


Average throughput of PUs in the interference range of a SU 



Note that we do not restrict the PU activity to any specific 
distribution except that it does not change within one time slot. 
The average throughput of PUs whose transmission would be 
interfered by SUs in S is denoted as 7. Table H] summarizes 
the notations used in the paper 

The SU communication follows a protocol with the follow- 
ing outline: 

1) SUs report P^'s and Pfs to SU-BS; 

2) SU-BS determines the sensing set Sq and the decision 
rule / based on Pys and the optimization metric; 

3) SU-BS notifies SUs in Sq with an ACK and also assigns 
each one of them a SEQ number for reporting sensing results; 

4) SUs receiving an ACK sense the channel and report the 
results to SU-BS in the order of SEQ; 

5) SU-BS makes the decision of the PU activity based on 
the sensing results and / and schedules an SU for transmission 
if the decision is (PU idle). 

IV. System Throughput Maximization 

In this section, we formulate the cooperative sensing prob- 
lem with the assumption that So = S, that is, the sensing 
results from all SUs in S are reported to SU-BS within Tc- 
SUs outside S can transmit without causing interference to the 
PUs. Thus, their performance is independent of the choice of 
the sensing set or the decision rule. Our goal is to maximize 
the sum of the expected throughput of SUs in S and that of 
the PUs whose transmission may be interfered by the SUs. 
It is equivalent to maximizing the expected throughput of the 
system with PU-SU co-existence. 

A. Problem Formulation 

Given 5 = (the PU is idle), the probability of a particular 
observation vector o occurring is 

p(oii?=o)= n ^/ n (1-^/)- 

ies,oi=i jes,oj=o 
The sum of all P{o\B = 0)'s with /(o) = is given as 

P(O = 0|B = 0)= ^ P(o|B = 0). (2) 

/(o)=0 
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Algorithm 1 Bayesian Decision Rule Based Algoritlim for maximizing 
tlie system thu'oughput (given o, decide O) 

1: if (1 - Tc)7ro 11 

Oi — l 



then 

O 
else 

O ^ 1 



n p; n (1 - pi) > 7 n (1 - n p?n 



ma.x L{B = 0,0 = 1) 



Then, the false alarm probability of cooperative sensing is 

Pf = 1- P{0 = 0\B = 0) = 1- J2 P{o\B^O). (3) 

/(o)=0 

Likewise, given B = 1 (the PU is active), the probabiUty 
of a particular observation vector o occurring is 

p(oii3=i)= n (i-o n 

ies,Oi=i jes,Oj=o 
The sum of all P{o\B = l)'s with /(o) = 1 is given as 

P(0 = 1|S = 1)= ^ P(o|B = l). (5) 

/(o) = l 

Then, the mis-detection probability of cooperative sensing 

is 

P,-„ = 1-P(0 = 1|P = 1) = 1- Pio\B = l). (6) 

f{o) = l 

Note that Equation ^ is the conditional probability that 
SU-BS correctly identifies the PU activity when it is idle so 
that one SU could transmit successfully; Equation (|5]) is the 
conditional probability that SU-BS correctly detects the PU is 
active so that no SU would transmit and the PU could transmit 
successfully. Accordingly, the expected throughput of the SUs 
can be represented by 

(1 - rc)P(P = 0, O = 0) = (1 - Tc)TraP{0 = 0|P = 0) 

= (l-TeKo ^(o|S = 0), (7) 

/(o)=0 

since the uplinks of SUs in S are assumed to have capacity 
1 and only one of them could be scheduled in each time slot. 
The expected throughput of the PU can be represented by 

7P(0 = 1|P = 1) = 7 J2 Pio\B = l) (8) 

/(o) = l 

since 7 is the average throughput of the PU whose transmis- 
sion would be interfered by SUs in S. The problem is then 
formulated as follows: 
Problem (A): 

max(l-rc)7ro ^ P(o|P = 0) + 7 ^ P(o|P = 1) 

^ /(o)=0 /(o) = l 

B. Optimal Solution with Bayesian Decision Rule 

We show that Problem (A) can be converted to a Bayesian 
Decision problem. Algorithm [T] is then proposed based on 
Bayesian decision rule to minimize the posterior expected loss 
IT] and it is of constant time complexity. 

Problem (A) is equivalent to Problem (|9]) in terms of optimal 

/• 



E Pio\B^O) 



/(o) = l 



-L{B = 1,0 = 0) 



{l-TTo) E P(0\B=1) 
f(o)=0 



(9) 



where L{B, O) is the loss of decision O based on observation 
o, which is a negative number L{B = 0, O = 1) = — (1 — Tc) 
and L{B = 1,0 = 0) = -j^- Thus Equation Q is the 
posterior expected loss of decision O (Definition 8 of Chapter 
4.4 in HI). Using the Bayesian decision rule. Problem ^ 
can be solved optimally HI: given o, the decision O = 1 
if |L(P = 0,0 = l)|7roP(o|P = 0) < \L{B = 1,0 = 
0)1(1 - 7ro)P(o|P = 1) and O = otherwise. Algorithm □ is 
designed accordingly. 

V. Guaranteeing a Target PU Throughput 

In this section, we investigate the maximum throughput 
problem with a PU throughput constraint. With higher priority, 
a minimum PU throughput is guaranteed in the problem 
formulation. We first show that this constrained problem is 
NP-hard by reducing the classical partition problem t6,| to 
it. Then a greedy approximation algorithm is proposed to 
achieve strictly greater than 1/2 of the optimal solution. The 
complexity of the algorithm is shown to be pseudo-polynomial 
by solving a two-dimensional dynamic programming problem. 

A. Problem Formulation and Properties 

We formulate the constrained optimization problem as fol- 
lows: 

Problem (B): 

max(l-Tc)7ro ^ -P(o|P = 0) + 7 ^ P{o\B = l) 

^ fio)=Q /(o) = l 

s.t. E Pio\P = !)>"■ (10) 

Equation ( fTOl i is the constraint we put on Problem (B) 
where the expected PU throughput must be no less than a 
preset system-dependent threshold. It is equivalent to 1 — 

E n (1-^™) n P^, <l-«,wherel-ais 

f{o) = lieS,Oi = l j(ES,Oj=0 

the collision factor. This can be interpreted as the probability 
that a PU transmission colliding with an SU transmission being 
no greater than 1 — a. Problem (B) maximizes the expected 
system throughput given that the lowest PU throughput can be 
met considering the high priority of the PU in cognitive radio 
networks. 

By observing the structure of Problem (B), we state 
Lemma ISTI that shows the optimal assignment of observations 
with G(o) < H{o) where G(o) = (l-Tc)7roP(o|P = 0) and 
H{o) = 'yP{o\B = 1). We define /* as the optimal solution 
to Problem (B). 

Lemma 5.1: In the optimal solution to Problem (B), we 
have f*{o) = 1 for all G(o) < H{o). 

Proof: (Prove by contradiction) Assume that /*(o) = 
for some o where G(o) < H{o). Moving it from O = to 
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C A 

0=0 0=1 

Fig. 3. Assignment of observations with no constraint and the optimal 
assignment for Problem (B). A + B + C is the optimal assignment with 
no constraint while A + B' + C \& the optimal solution to Problem (B). 

= 1 increases ^ P{o\B = 1) so that this operation still 

/(o) = l 

makes a feasible solution. Furthermore, the expected system 
throughput increases considering G(o) < H{o), which makes 
a better solution than the current optimal one. It causes a 
contradiction. Hence, we have /*(o) = 1 for all G(o) < H{o) 
in the optimal solution to Problem (B). ■ 
With the property of Lemma 15.11 we only need to decide 
which observations with G(o) > H{o) should be put in 
O = 1 to solve Problem (B) optimally. We define x = 
{o : G(o) > H{o) and /*(o) = 1}, which is the set of 
observations that need to be moved to O = 1 in the optimal 
solution; ip = {o : G(o) > H{o) and /*(o) = 0}, which 
is the set of observations that stay in O = in the optimal 
solution. Moreover, we define A = ^ H{o), which 

o:G{o}<H{o) 

is the contribution of observations with G(o) < H{o) in the 
optimal solution; B ^ J2 G{o), which is the contribution 

of observations in x when put in O — 0; B' — ^ H{o), 

which is the contribution of observations in x when put in 
O = 1 (B > B'); C ^ J2 G{o), which is the contribution 

of observations in -0 when put in O = 0; G' = ^ H{o), 

oeip 

which is the contribution of observations in ijj when put in 
O = 1 (G > C). Then, A + B + G is the optimal solution to 
Problem (B) without the PU throughput constraint; A + B' + C 
is the optimal solution to Problem (B), which is no greater than 
A + B + C. The optimal assignment is illustrated in Figure |3] 

B. Proof of NP-hardness 

We focus on identifying the hardness of deciding observa- 
tions with G(o) > H{o) that should be put in O = 1 in this 
section. It is shown to be NP-hard by reducing the classical 
partition problem |i6J to the subproblem of it in Theorem 15.21 

Theorem 5.2: Problem (B) is NP-hard. 
Proof: We first state the classical partition problem |6l 
- Given N positive integers: yi, • • • , i/n, is there a way to 
have them partitioned into two equal-sized subsets that have 
the same sum? For reduction, we construct an instance of 
Problem (B) by setting (1 - rc)7ro = 7, + P} < 1, 
a = e + J2 H{o) with e < min H{o). 

o:G(o)<H(o) o:G(o)>H(o) 

For this instance, putting any o with G(o) > H{o) to 
= 1 would make a feasible solution given that observations 
with G(o) < H{o) have all been put in O = 1. Obvi- 
ously, choosing the observation with minimum non-negative 
G(o) — H{o) would be the optimal solution. Note that 
G(o) — H{o) = is equivalent to log = 0. By setting 



Algorithm 2 Greedy Approximation Algorithm for Problein (B) 
Input: N, Ta, ttq, 7, a, P^, P) for all i 
Output: / or "infeasible" 

1: G(o) 4- (1 - Tc)nQ n Pj n (1 - Pi) for all o 

ies,o;=i jes,oj=o 

2: H{o) ^7 n (1 - -Pm) n pL for all o 
3: U^J:h(o) 

o 

4: if C/ < a X 7 then 

5: output "infeasible" and return 

6: /(o) 4- 1 for all o, sural <— Yl ^^{o) 

o:G{o)<H(o) 

1: if suml > a X 7 then 

8: /(o) = for all o with G(o) > H{o) and return 
9: Sort o's with G(o) > H{o) in non-increasing order of and label 

them from 1 to I 

10: sum2 ^ 

11: for i = 1 to i do 

12: if sum2 + H(oi) > U — a X then break 
13: sum2 <— sum2 + H{oi), f{oi) <— 



log = - log = Vt for all i, we have log = 

2 Vi + J2 ^Vj- Now the instance be- 

Oi=0,i=l,--- ,N Oj = l,j = l,--- ,N 

comes: given N pairs of integers {yi, ~yi), • • • , (yjv, -Vn), 
exactly one number should be chosen from each pair; with this 
constraint, what is the minimum non-negative sum? The re- 
duction from the partition problem to this instance of Problem 
(B) can be done in polynomial time. 

To verify the correctness of the reduction, we can check: 
if the minimum non-negative G(o) — H{o) is 0, that is, the 
optimal solution of the instance is 0, we can answer "Yes" to 
the partition problem; if it is positive, we can answer "No" 
to the partition problem. If Problem (B) can be solved in 
polynomial time, then the partition problem can be solved in 
polynomial time as well. The partition problem is well-known 
to be NP-complete IS). Assuming P^^NP, Problem (B) has 
been proven to be NP-hard. ■ 

It has been shown in Theorem 15.21 that finding the obser- 
vation with G(o) closest to H{o) from above is NP-hard. 
Hence, it is unlikely to find an efficient algorithm to solve 
Problem (B) optimally. We will focus on the approximation 
algorithm design in Section IV-CI 

C. Greedy Approximation Algorithm 

We propose a greedy algorithm (Algorithm |2]i that initially 
assigns all observations to O = 1 and then moves observations 
with G(o) > H{o) by from the highest to lowest to O = 
until the feasibility constraint of Problem (B) is violated. By 
transforming Problem (B) into the Knapsack Problem 1231 . 
we will show that the algorithm achieves strictly greater than 
1/2 of the optimal solution for Problem (B), which is A + 
B' + G in Figure [3] Although the sum of G(o) or H{o) 
in the worst case has exponential number of terms, we will 
design a pseudo-polynomial time algorithm in Section IV-DI 
considering its combinatorial nature. Ignoring rounding errors, 
the implementation calculates these sums accurately. 

In Algorithm [2] observations are chosen by from the 
highest to the lowest and assigned to O = after those with 
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G{o) < H{o) are assigned to O = 1. Ties are broken by 
putting observations with smaller H{o) in the front. In Line 3, 
U is assigned to be the sum of contributions of all observations 
if put in O = 1 (A + B' + C'). In Lines 4-5, whether a feasible 
solution exists for the given input is checked by comparing 
the extreme case where all observations are assigned to O = 1 
with the threshold a x 7. In Line 6, observations are initialized 
to O = 1. Lines 6-8 checks whether the feasibility constraint in 
Problem (B) has been satisfied under the initial assignment. If 
yes, observations with G(o) > H{o) are assigned to O = by 
Bayesian decision rule. Lines 9-13 searches for observations 
with G(o) > H{o) from the highest to lowest until 

sum2 + H{oi) < L/ — a X 7 is violated (Line 12). Note 
that J2 H{o) < L/ - a X 7 and J2 H{o) > a x 

o:/(o)=0 o:/(o) = l 

7 (feasibility constraint) are equivalent since '}2iH{o) = U. 

o 

f{o) of these observations are set to be (Line 13) in the 
searching process. Next, we state Theorem 15.31 that gives the 
approximation factor of Algorithm |2l 

Theorem 5.3: Algorithmic achieves strictly greater than 1/2 
of the optimal solution to Problem (B). 

Proof: Recall the following notations: A = 
E H{o), B = EGio), B' = EHio), 

o:G(o)<H(o) oex oex 

C = E G(o), C ^ J2 H{o), andW = A + B + C. 
We define U = A + B'°Tc" = Y.H{o). Let APX be the 

o 

solution to Problem (B) output by Algorithm [21 Let OPT be 
the optimal solution to Problem (B). Then, we have 

OPT = W -{B -B') (11) 
= A + C + B' = {A + B' + C') + {C-C') = U+{C-C'). 

Equation (fTTT i holds by Figure [5] since moving observations 
from O = to O = 1 loses B — B' in throughput compared 
to the optimal solution with no constraint, which is W. Note 
that (C — C") is the optimal solution to Problem ( fT2] i and it is 
the difference of contribution to throughput between keeping 
observations with G{o) > H{o) and /*(o) = in O = and 
moving them to O = 1. 

max ^ {G{oi) ~ H{oi))xi 

G(oi)>H(oi).i=l,--- ,1 

s.t. ^ H{oi)xi <U -ax^ (12) 

G(oi)>H{oi),i=l,--- d 

Xi e {0, 1} for all i, 

where observations with G(o) > H{o) are labeled in an 
arbitrary order Next, we will show that the constraint of 
Problem ( fT2b and that of Problem (B) are equivalent. As shown 
in Figure [3] we have 

A + B'>aX'y^A + B'-aX'y>0 

^A + B' + C'-ax-/>C'<=>C'<U~ax-f. (13) 

Problem (fT2] | is a Knapsack Problem ||23| . By the fol- 
lowing greedy approach, at least 1/2 of (C — C) can be 
achieved 1231 : choosing observations with G{oi) > H{oi) 



Algorithm 3 Pseudo-Polynomial Algorithm to Find the Joint Distribution 

oi(\og§^y\ogH{o)) 

Input: N, Pj. p4 for all i 
Output: C{N, j, j') for all j, j' 



Vi 



round{log ■ ^ , r) X 10'" for all i 



Zi <— round(log ^_pi 1 1") X 10*^ for all i 

Xi <— round{logP^,r) X 10'' for all i 

fii <— rcmnd{log (1 — P^),r) X IC for all i 

N N 
M <- J2 nia.x {yi,Zi}, m -i^ J2 niin {yi , Zi} 

i = l i = l 

AT 

M' <— max {max Ai , max/ii}, m' <— min {\i,fii} 

» » i=i 

C{i,j,j') for all i, j, j', C(l,yi,Xi) ^ 1, C(l,2i,Ati) ^ 1 
for i = 1 to AT - 1 do 
for j = m to M do 

for j' = m' to M' do C{i + = C(i,j - yi+i,j' - 

Xi+i) + C{i,j - Zi+i,j' - fii+i) 



from the highest '^''°h{o^)°^'' '■^^ lowest until (fT2] | is 
violated, which is exactly what we do in Algorithm [2] since 



G(o,- 



-H{o,) 



if and only if ^ > ^ 



Hence, APX = U + 1/2{C - C) holds. Since > 0, we 
always have APX /OPT > 1/2 for Problem (B). ■ 
So far, we have shown that the greedy approximation algo- 
rithm (Algorithm 121) exists for Problem (B) with an approxi- 
mation factor strictly greater than 1/2. When U ^ C — C , 
this factor could be arbitrarily close to 1. 

D. Pseudo-Polynomial Implementation 

In Lines 3, 6, 9 and 12 of Algorithm |2l exponential 
number of observations are involved in the worst case due 
to its combinatorial nature. We design a pseudo-polynomial 
time algorithm by means of dynamic programming for the 
implementation. The running time of a pseudo-polynomial 
time algorithm is polynomial in the numeric value of the 
input, which is exponential in the length of them assuming 
they are rational numbers [23]. For simplicity, we assume 
(1 — Tc)7ro = 7 in Algorithmic] which can though be extended 
to general cases without the assumption easily. 

In Algorithm l3] dynamic programming is applied to cal- 
culate the joint distribution of log and log iJ(o), which 

counts the number of observations with the same log ^frr\ and 

^ H{o) 

the same log i7(o). Note that this algorithm does not require 
future information - only the collection of all sensing results 
from SUs in the current time slot is required. Lines 3, 6, 
9 and 12 of Algorithm 121 can be calculated based on these 
counts. round{a, r) rounds a to r decimal places. We use 
round{a, r) x 10^ to scale and round a real a to an integer. 
The rounding error will be discussed in the simulation. M 
and m specify the maximum and minimum contribution an 
observation o can have to log respectively, while M' and 
m' specify the maximum and minimum contribution an obser- 
vation o can have to log H{o) respectively. C{N, j, j') records 
the number of observations with log (after rounding) 
equal to j and log H{o) (after rounding) equal to j'. Boundary 
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conditions are set in Line 7. Lines 8-10 use iterations to 
find C{i,j,j') for all i = I,-- - ,A^, m < j < M and 
m' < j' < M' . The recursive function in Line 10 matches 
the fact that when the observation from SU i + 1 is added to 
observations from SU 1 to i, log is added by yij^i and 
log H{o) is added by Aj+i; on the other hand, when the 1 
observation from SU i + 1 is added to observations from SU 
1 to i, log is added by z^+i and log_ff(o) is added by 
Hi+i. Note that Line 10 may encounter C{i,j,j') beyond the 
boundaries of j or j', the value of which will be treated as 0. 
The time complexity is 0{N{M — m){M' — ra')), which is 
pseudo-polynomial. 

After the C{N,j,j') distribution is found. Lines 3, 6, 9 
and 12 of Algorithm |2] can be calculated accordingly, the time 
complexity of which is dominated by 0{N{M — m){M' — 
m')). 

VI. Sensing Set Identification 

In this section, we formulate a new problem where the SU- 
BS is free to choose any subset of S as the sensing set and 
maximizes the expected throughput of the system. We define 
d as the homogeneous reporting delay of the sensing results 
from an SU to the SU-BS, and d as the miscellaneous delay 
which covers all the processing required after the collection 
of sensing results at the SU-BS in Tc- No matter how many 
SUs are chosen in the sensing set, we always allocate the 
length of Tc as the control slot. Thus, the control overhead 
is still a constant in this section. Two cases are considered: 
Nd + d < Tc, where all SUs are allowed in the sensing set 
(Section NTM : Nd + d > Tc, where at most k = L^^J 
SUs are allowed in the sensing set (Section |VI-Bt . We show 
that the system throughput is monotonic over the number of 
SUs chosen in So in Proposition 16.11 The hardness of the 
constrained problem with Nd + d > Tc is unknown and there 
is no efficient algorithm proposed for this type of problem so 
far. 

A. Nd + d< Tc 

Without the constraint of the number of SUs in Sq, we will 
show that the full set gives the most information. First, we 
define 

p(o^"iB=o)= n ^/ n (1-^/)' (14) 

which is the probability of a particular observation vector o^" 
where So is the sensing set occurring given B = 0, and 

p{o^"\B^i)^ n (i-/^^j n (15) 

which is the probability of a particular observation vector o^'" 
where So is the sensing set occurring given B = 1. Then we 
formulate the problem as follows: 
Problem (C): 

max (1 - Tc)tto V P(o'^« \B = 0) 

/(o)=0 

f{o) = l 



Proposition 6.1: Let F*{So) be the optimal solution to 
Problem (C) with 5*0 fixed. Then 

F*{{ti,---,tk,ik+i})>F*{{ii,---,ik}). (16) 

Proof: (sketch) By adding the fc+l-th SU into the sensing 
set, we can at least achieve the same system throughput as 
before by ignoring its observation. The detailed proof can be 
found in our technical report |[T6l . ■ 

Using Proposition 16.11 we will prove it is the best choice 
to choose the full set as the sensing set in Corollary 16.21 

Corollary 6.2: For all D C S,we have F*{S) > F*{D). 
Proof: Given Z? C 5, we index the elements in S\D from 
the smallest to the largest as Zi, • • • , where m = |5'\£'| and 
< TO < iV. By Proposition O we have F*{D) < F*{DU 
{h}) < F*{DU{h,l2}) < •■• < F*{DU{h,--- ,lm}) = 

F*{S). m 

By Corollarv 16.21 Problem (B) can be solved by first setting 
Sq = S and then applying Algorithm [T| to find the optimal 
decision rule. Note that the time complexity is still 0(1). 

B. Nd + d> Tc 

We also investigate the case where the number of SUs in So 
is constrained, and state that it is unlikely to have an efficient 
algorithm to find the optimal solution. By Proposition 16. II the 
problem can be formulated as follows: 

Problem (D): 

max (1 - Tc)Tro V P(o^o \B = 0) 

f-)3o — ^ 

/(o)=0 

+7 P{o^"\B = 1) 

/(o) = l 

s.t. 1 5*0 1 — k, 

where k = L^^J- 

It has been shown in ifTsll that no non-exhaustive search 
method over the subset of S of size k can always solve it op- 
timally when observations are correlated. For the independent 
observation problem such as Problem (D), however, it is not 
clear whether exhaustive search would be necessary as shown 
in 1221 . Many heuristics such as Sequential Forward Selection 
(SFS, IH), Sequential Backward Selection (SBS, [19]) and 
their variations H have been proposed to solve problems of 
this type. Although we characterize the monotonic property of 
system throughput over the number of SUs in the sensing set, 
the complexity of the problem is not clear in the case when 
Tc is small, compared to N. 

VII. Simulations 

In this section, simulation results are presented for the 
performance of solutions proposed for Problems (A), (B), 
(C) and (D). We first compare the performance of Bayesian 
decision rule (Algorithm [T]i, majority, AND and OR policies 
12511 in Section IVII-AI Then the performance of the greedy 
algorithm for Problem (B) (Algorithm|2]i, the random selection 
and the optimal solution are presented in Section IVII-BI The 
performance of Sequential Forward Selection (SFS, lT9l ) is 
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A' = 10 = 0.2 TTO = 0.4 



Tc = 0.2 IT,, = 0.4 -, = 2 



1\ = 10 Jc = 0.2 TTo U.4 7 = 2 a = O.fS 



-s- Bayesian 




-•- Majority 




AND 




— OR 











1 1.2 1.4 1.6 1.8 2 2.2 2.4 2.6 2.8 

7 (PU throughput) 

(a) With different 7's. 




N (number of SUs) 

(b) With different Af's. 



Fig. 4. Performance comparison of Bayesian decision rule, majority, AND 
and OR. 

compared with the optimal solution to Problem (D) in Sec- 
tion [VI^Cl In all simulation studies, we consider a cognitive 
radio network with N = 10, = 0.2, ttq = 0.4, and 7 = 2. 
For each parameter setting, we generate 30 groups of -P^'s 
and Pys randomly, which represents the random geographical 
locations of SUs in a CRN. 

A. Performance of Bayesian Decision rule 

Algorithm [T] the Bayesian decision rule based algorithm, 
has been proven to be optimal in Section |IV] In Figure |4] 
we demonstrate the increase from majority, AND, OR rules 
in terms of system throughput, which is the objective function 
value of Problem (A). In majority rule, the decision is 1 only 
when the majority of the SUs sense an active PU; in AND 
rule, the decision is 1 only when all SUs sense an active PU; 
in OR rule, the decision is 1 if any of the SUs senses an 
active PU. We vary 7, the average PU throughput, and N, 
the number of SUs, respectively. Among all four algorithms, 
Bayesian decision rule strictly outperforms the other three. 
Among them, the OR rule is better then AND and majority 
rules since the PU transmission is better protected by the OR 
rule. OR rule is too conservative to guarantee SU transmission. 

B. Performance of Greedy Algorithm 

Greedy algorithm (Algorithmic can achieve strictly greater 
than 1/2 of the optimal solution to Problem (B), as shown in 
Section |V] We compare its performance with that of random 
selection. Random selection is also based on Bayesian decision 
rule, which means Algorithm [T] is first executed; after that, 
observations with G{o) > H{o) are randomly selected to 
put in O = 1 until the feasibility is satisfied. Thus the main 
difference between greedy algorithm and random selection lies 
in the selection criterion of observations with G(o) > H{o) 
after the initial assignment based on Bayesian decision rule. 

In addition, we set a = 0.8, and r = 2. We vary parameters 
such as 7, the average PU throughput, a, the PU throughput 
constraint, N, the number of SUs, and r, the decimal places 
kept in Algorithm [3] in Figures |5] and |6] respectively. To show 
the approximation factor of our algorithm accurately, two 
boundary cases are excluded in the result presentation where 
both greedy algorithm and random selection will give the 
optimal solution: 1) Bayesian decision rule gives the optimal 
solution; 2) It is optimal to put all observations in O = 1. 
Hence, we only show their performance when at least one but 




r (decimal places) 

Fig. 6. Performance comparison of greedy algorithm over different r's. 



not all observations with G{o) > H{o) have to be moved to 
= 1. The average case and worst case performances are 
calculated based on the results after the exclusion. 

In Figure |5(a)[ the approximation factors of greedy algo- 
rithm and random selection over the optimal solution are com- 
pared over different values of 7, the average PU throughput 
in the system. With a higher 7, the factor decreases gradually 
in both average and worst cases of Algorithm [2] and it is 
the same with random selection although it fluctuates a bit 
due to the random selection. Greedy algorithm outperforms 
random selection in both average and worst cases. Potentially, 
the Bayesian decision rule assigns more SUs to O = 1 
compared to a lower 7 case. Thus the initial assignment is 
closer to a, the PU throughput constraint. Since we only 
consider cases where Bayesian decision rule is not optimal, 
both algorithms tend to have worse performance when the 
initial assignment approaches a because it gets more sensitive 
to a wrong observation selection. 

In Figure |5(b)[ we vary a, the PU throughput constraint, 
and compare the performance of greedy algorithm and random 
selection. Greedy algorithm is obviously better than random 
selection in both average and worst cases. Furthermore, the 
worst performance of all random runs generated in greedy 
algorithm wins over the average performance in random 
selection. Approximation factors in both of them increase, 
although it is minor in greedy algorithm. The increase can be 
explained similarly to that in Figure |5(a)[ a higher a makes 
the initial Bayesian decision assignment farther away from 
it so that the performance is less sensitive to the choice of 
observations. Due to its randomness, random selection may 
have poor performance with the factor as low as about 0.4 in 
our simulation. 

We test the performance of greedy algorithm with different 
scales of the network. The number of SUs is varied from 5 
to 10. The approximation factors of both algorithms degrade 
with more SUs. However, the factor is always far above 1/2, 
as proved in Theorem 15. 3 1 The random selection drops below 
1/2 in some cases as shown in the figure. 

In Algorithm [3] we use r as the decimal places kept for 
the calculations. Although the rounding error is not strictly 
characterized in Section III, we show the improvement of 
performance with higher r, which means higher resolution, 
in Figure |6] Note that the worst case performance is always 
greater than 1/2. The average performance increases from 
about 0.7 to about 0.95, which is promising. 
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A' = in T, = n.2 770 = 0.4 Q = 0.8 ( 



.Y = 10 = 0.2 TTn = 0.4 7 = 2 r = 2 
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-e- Greedy average 
—•—Greedy worst case 
-A- Random average 
Random worst case 




(PU throughput) 

(a) With different 7's. 



0.8 0.82 0.84 0.86 0.88 0.9 0.92 0.94 0.96 
a fconstraint) 




(b) With different a's. 
Fig. 5. Performance comparison of greedy algorithm and random selection. 



N (number of SUs) 

(c) With different A^'s. 



= 0.2 ra = 0.4 -, = 2 




123456789 10 
k (number ul SUs in the sensing set) 

Fig. 7. Performance comparison of SFS over different fc's. 
C. Performance of Sequential Forward Selection 

With Nd + d< Tc, Problem (C) is the same as Problem (A) 
in that the optimal sensing set is the full set (Corollary I6.2l l. 
Thus, we focus on the performance of SFS, a heuristic for 
Problem (D) whose hardness is unknown so far. In SFS, we 
start from an empty sensing set. In every step, only the SU 
that is not yet chosen and has the largest marginal increase 
to the system throughput is added to the set. The algorithm 
stops when the size of the set reaches k. In Figure |7] we vary 
k, the size of the sensing set, from 1 to and show the 
approximation factor of SFS compared to the optimal solution 
to Problem (D). When k increases, the performance of SFS 
degrades until k ~ N where all SUs are chosen in the sensing 
set so that the order of selection does not matter. SFS on 
average achieves at least 0.8 of the optimal solution in our 
simulation although the factor is lower than 0.6 in one of the 
worst cases. 

VIII. CONCLUSION 

In this paper, we propose a series of algorithms to maximize 
the system throughput by cooperative sensing in cognitive 
radio networks. Bayesian decision rule is applied to solve 
the unconstrained optimization problem optimally. With the 
PU throughput constraint, the new problem is shown to be 
NP-hard and a greedy approximation algorithm with pseudo- 
polynomial time complexity is proposed. More importantly, 
the approximation factor is strictly greater than 1/2. By 
restricting the number of SUs chosen for sensing, a new 
constrained optimization problem is formulated. We present a 
structural property that more SUs lead to better performance. 
However, the characterization of the combinatorial problem 
remains elusive, which is our future work. Moreover, we are 
also interested in investigating cases where the observations 
of SUs are correlated. 
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